Automated service level management system

ABSTRACT

A method for service level management comprises identifying connected enterprise application components and, under control of an automated system, relating historical performance for the connected enterprise application components and electronically creating a service level agreement based on the historical performance relation.

BACKGROUND

Service Level Agreements (SLAs) are used in companies in many fieldsincluding financials services, natural resources such as oil and gas,and others. Service Level Management (SLM) has developed for managingcreation and maintenance of SLAs. SLM formalizes the service agreementprocess between business service customers and Information Technology(IT). SLAs originally applied to agreements relating to equipment. Morerecently, SLAs have evolved toward agreements relating to overridingbusiness services.

Service providers such Information Technology (IT) providers benefitfrom agreements with customers regarding quality of service to avoiddisputes, allocate resources, and manage costs. However, adoption ofSLAs has been limited both in terms of the number of companiesimplementing service-based service level agreements and the percentageof services that companies govern by service level agreements.

Adoption of Service Level Management (SLM) and Service Level Agreements(SLAs) has been limited by the extensive resource allocation forcreating SLAs for the large number, for example hundreds or thousands,of business services that may be covered by a single provider.

Using conventional manual techniques, implementation of agreementsacross an entire mix of business services can cost companies millions ofdollars. Once a service is included in a service catalog, lower impactand higher impact business services are treated the same so that evenvery low impact business services use a disproportionate amount ofresources due to lack of formal prioritization or accountability.Service Level Management was created to solve the resource allocationproblem. However, implementation costs continue to limit adoption eventhough SLAs have the potential for managing operational demand of IT.

Conventionally, SLAs are created by using Service Level Managementsoftware that allows Information Technology (IT) to formally captureagreements between the customer and IT. The software manually recordsthe service level criteria developed by service level manager workingwith a customer so that a substantial labor cost is incurred in creatingSLAs. SLAs are created on a case by case basis so that realisticestimates of service quality and budgeting for individual service levelagreements are difficult.

In conventional operation, a Service Level Manager (SLM) typicallystarts a SLA process by collecting service level delivery informationfor several months and then using the information for negotiationbetween the customer and an Information Technology (IT) supplier. TheSLM can also review usage trending with capacity and availability todevelop an equipment plan for a future agreement period. The informationis used by the Service Level Manager to negotiate terms with a customer.Once an agreement is in place, the Service Level Manager can report backon the performance of IT for the agreement on apercentage-of-availability delivered during defined hours of operation.

Conventional SLM systems involve high personnel costs for SLA creation.Also, because SLAs are created individually and rarely, overalltradeoffs and costs for multiple business services are not collected.SLAs are commonly created without considering possible alternatives toaspects of business services delivery and resource costs of the variousalternatives. SLAs are conventionally created without relating qualityof service and cost due to lack of information relating variousservices. A further difficulty with conventional SLM is that ITorganizations and business services typically do not have a commondefinition of the affect of actual service delivered.

SUMMARY

An embodiment of method for service level management comprisesidentifying connected enterprise application components and, undercontrol of an automated system, relating historical performance for theconnected enterprise application components and electronically creatinga service level agreement based on the historical performance relation.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the invention relating to both structure and method ofoperation may best be understood by referring to the followingdescription and accompanying drawings:

FIG. 1 is a schematic block diagram that illustrates an embodiment ofinfrastructure configured for behavioral modeling;

FIGS. 2A through 2E are schematic flow charts depicting embodiments ofservice level management methods;

FIG. 3 is a schematic block diagram that depicts an embodiment of aninfrastructure adapted for behavioral simulation;

FIGS. 4A through 4C are schematic flow charts showing embodiments of aservice level management method that perform a business infrastructuresimulation;

FIG. 4D is a graph that describes a general method for integration intoa priority model;

FIG. 5 is a schematic block diagram illustrating an embodiment of abusiness services model that operates in the manner of a “Sim-City” typesimulation for an information technology (IT) application;

FIGS. 6A, 6B, and 6C are schematic flow charts depicting embodiments ofa service level management method that handle business services based onseveral considerations;

FIG. 7 is a screen shot sequence showing an embodiment of a value centerimplementation; and

FIG. 8 is a schematic flow chart illustrating an embodiment of a servicelevel management method that can be used as a decision center for anautomated service level agreement system (ASLAS).

DETAILED DESCRIPTION

An Automated Service Level Agreement System (ASLAS) automates creationand maintenance of service level agreements (SLAs). Service LevelAgreements (SLAs) are useful for managing an information technology (IT)portfolio and for establishing the portfolio quality-of-service (QOS)delivered to IT customers. An illustrative ASLAS automates discovery ofperformance baselines and creation, maintenance, and tracking of SLAs.

Automated Service Level Agreement System (ASLAS) automates creation andmaintenance of service level agreements (SLAs) to effectively balanceservice level priority, quality of service delivered, IT cost, and otheraspects of business service handling.

The ASLAS can implement a value center concept that enables optimizationof service level management.

An illustrative Automated Service Level Agreement System (ASLAS)comprises multiple various enterprise application component parts thatare combined into an integrated form. In an initial operation, the ASLAScan enable a user to establish a service catalog if the catalog does notyet exist. In some embodiments and conditions, an application mappingtool may be employed to discover connected enterprise applicationcomponents including networks, servers, storage, and the like. Once theinformation is collected, business impact analysis can be performed forsome or all business services and entered into the ASLAS to performactions including service level balancing and staff loading orscheduling.

The ASLAS analysis and management actions can be based on the particularbusiness service objects in the system and historically-derivedperformance. Historically-derived performance, in one example, may betracked by relating the number of occurrences of service performancefailure or degradation that results in down-time during a month to thetotal available time during the month. In conditions that a servicedependency map exists, the map can be used with a record of aConfiguration item (CI) to determine the affect of component performanceon service delivery.

The ASLAS uses historical measures to model service level performanceand to model an expected level of performance for a future period. Abusiness service can be analyzed on the basis of percentageavailability, a response time distribution, and a resolution timedistribution during a measured time period. The business serviceanalysis enables determination of overall business service performanceincluding, for example, expected availability, expected incident rate,expected response, and expected resolution time. One deficiency oftraditional business service analysis is an inability to modelhistorical performance of an entire service portfolio and then determinethe affects of changing decisions about service levels and staffing.

Referring to FIG. 1, a schematic block diagram illustrates an embodimentof infrastructure 100 configured for behavioral modeling. Theinfrastructure 100 enables asset classes of like behavior to bedetermined, thus enabling prediction of the business serviceperformance. Enterprise application components depicted in theinfrastructure 100 has computer systems 102 including UNIX/Mainframesystems 104U, 104M and Windows/Mac systems 106. The Windows/Mac systems106 further can be classified as shared servers 106S and personalcomputers 106P that are further classified into desktops 106D andlaptops 106L.

FIG. 1 illustrates operation of ASLAS to effectively develop behavioralmodels for each asset class used for a business service or a supportedapplication. ASLAS aggregates performance across each asset class withina configuration management database (CMDB). Performance measurementsinclude availability, incident rate, response times, incident resolvetimes, and possibly other criteria. With performance information,software can establish an expected level of performance in futureoperation by aggregating expected asset class performance for some orall components of a business service or delivered application.

In the illustrative system, component performance data can be derived byinterrelating business service object components with historicalperformance level data stored in Service Level Management software orhistorical repositories. Once extracted, the component performance datacan then be used to produce a virtual or derived Service Level Agreement(SLA). The ASLAS enables aggregate analysis of multiple enterprise levelcomponents that can be highly variable in characteristics, and furtherincludes consideration of labor pool and labor pool mix, for forming theSLA.

Referring to FIG. 2A, a schematic flow chart depicts an embodiment of aservice level management method 200. The method 200 comprisesidentifying 202 connected enterprise application components and, undercontrol of an automated system 204, relating 206 historical performancefor the connected enterprise application components. The method furthercomprises electronically creating 208 a service level agreement based onthe historical performance relation.

In some embodiments, the method 200 may further comprise creating 203 aservice catalog designating enterprise application components. Based onthe service catalog, an automated service level management system useshistorical data to model define applications and automatically create aservice level agreement, either virtual or defacto, for businessservices.

The ASLAS uses the historical information to perform business impactanalysis and balance service levels, resources, service maintenancelevels, resources, service maintenance schedules, cost, and servicequality. In the process, the number of Service Level Managers to performmanagement services can be greatly reduced, shifting to a role focusedon acquiring services, service level planning, and negotiation of terms.The ASLAS enables allocation of resources according to businessimpact-ladened demand so that the method 200 has potential to improvebetter service quality while reducing overall staffing.

In addition to using the historical record and business impact rulesdefined by organization departments devoted to business services, impactof business performance can be analyzed and understood in terms of aspecific business unit, the business as a whole, or any subset ofbusiness units.

The service level management method 200 improves performance andefficiency of SLM in part by automating the process for creating servicelevel agreements from creation to deployment. The process automatescollection of historical data and information inputting to the system,as well as analysis and deployment of SLAs. In contrast, a conventionalSLM system has service level managers develop a service levelnegotiating point for every business service.

Referring to FIG. 2B, a service level management method 210 can furthercomprise storing 212 historical performance level data in a storage suchas a computer memory, long-term storage devices such as disks or tapes,network storage, and the like. The method 210 can further includeinterrelating 214 business service object components with the storedhistorical performance level data. Business service componentperformance can be electronically derived 216 from the interrelatedbusiness service object components and the historical performance leveldata.

In some implementations, the derived business service componentperformance can be integrated 218 into a historically-based servicebaseline. The historically-based service baseline can be combined 219into the historical performance level data.

In a particular embodiment, data from business customers regardingservice impact and opportunity cost can be collected 220 for a changedcomponent performance condition. The method 210 can further compriseanalyzing 221 a series of historically-based service baselines andopportunity cost data. An optimum service level agreement baseline canbe derived 222 from the analysis.

Some embodiments can implement actions including collecting 223 datafrom business customers regarding historical component outage andchanged component performance condition. Historical component outage andsystem performance variation to component performance can be related224.

Referring to FIG. 2C, a service level management method 230 can furthercomprise storing 231 historical performance level data in a storage ormemory and interrelating 232 business service object components with thestored historical performance level data. A designation of businessimpact can be assigned 233 to business services, associating 234 thebusiness impact with a plurality of different elements of an informationtechnology (IT) environment including business services, departments,operators, and work groups.

A unified metric for cost can be created 235 that is directlytranslatable to revenue from the assigned business impact. For example,a unified cost function can be specified 236 as a business impact thatdescribes value of delivery of a portfolio of services by informationtechnology to a business.

In some configurations, the method 230 can further compriseelectronically balancing 237 business impact aspects including servicelevel, resources, service maintenance schedules, cost, and quality ofservice. Virtual service level agreements can be created 238 based onthe balance.

Referring to FIG. 2D, a service level management method 240 can furthercollect information for relating information technology (IT)organizations to business services. The method 240 can comprisecollecting 241 information on information technology (IT) organizationsthat supply services to business services and collecting 242 informationon the business services that receive services from the ITorganizations. Relationships between the IT organizations and businessservices are established 243 and business impact rules at the ITorganization level for downtime and/or degraded performance areestablished 244.

In some embodiments, the method 240 can further comprise analyzing 245the established IT organization-business services relationships and theestablished business impact rules in combination with the historicalperformance. Historical performance impact is determined 246 based onthe analysis.

Referring to FIG. 2E, a service level management method 250 candetermine address work-around solutions to events or conditions. Impactof downtime and/or degraded performance can be identified 252 at awork-around level. An auditable impact trail can be created 254 based onthe identified impact.

Referring to FIG. 3, a schematic block diagram depicts an embodiment ofan infrastructure 300 adapted for behavioral simulation. Infrastructureclass models can be used to simulate future behavior. The infrastructure300 includes an incident behavior model 302 and a resolution behaviormodel 304. A data storage 306A supplies information such as time of day,a change history, capacity information, a maintenance history, userskills, and the like to the incident behavior model 302. A data storage306B supplies information including asset class weights, historicincident data, industry behavior, configuration compliance, and othersto the incident behavior model 302 and the resolution behavior model304. A data storage 306C supplies information including problem type,relative priority, worker skills, user responsiveness, and the like tothe resolution behavior model 304. Information from the incidentbehavior model 302 and the resolution behavior model 304 is applied to awork queue simulation 308 along with feedback 310 including servicelevel objectives and staffing levels. Results of the simulation aretransmitted from the work queue simulation 308 to a store function 312that stores service outage information. An optimization 314 can be runto determine service level objectives and staffing levels and applied asfeedback 310.

The chart depicted in FIG. 3 shows how service is modeled in terms ofvarious events and conditions such as incidents, costs, and risk. Byaggregating the service models, global tradeoffs and decisions can bemade across the business services portfolio, enabling resource andservice decisions to be made across the service portfolio. Modelingenables agreements to be made concerning various aspects of businessservices including the labor pool and expected requirements. Byembedding realistic agreements into service management software,performance can be driven to items of greatest business value.

In a next level of operation following modeling 302 and 304, a ServiceLevel Manager collects input information from business customersregarding the costs/impact/opportunity cost for downtime or systemperformance change for a business service. The ASLAS takes intoconsideration the information from business customers in combinationwith accumulated historical service data. As a result of the analysis,the ASLAS can trade off historical system service quality againstopportunity cost in an operational research model. The ASLAS canautomatically derive Service Level Agreements (SLAs) across anInformation Technology (IT) portfolio by calculating interrelatingservice opportunity cost to available IT investment.

The system optimizes the distribution of service level objectives byrelating requirements and labor to downtime impacts for each businessservice. An insert pictorial diagram in FIG. 3 shows an embodiment of anoptimization process. A plurality of business services 320 are shown incombination with lines 322 depicting the expected performance for abusiness service. Incidents information is queued in an incidence queue324 and activates a skill group 326 to adjust performance for theservice. The output determination of the optimization can be expressedas IT cost versus risk change. Service changes are shown as a responseto a change in expected results or in business service knobs 324.

Referring to FIG. 4A, a schematic flow chart depicts an embodiment of aservice level management method 400 that performs a businessinfrastructure simulation. The service level management method 400 isperformed according to acts comprising establishing 401 an informationtechnology (IT) environment model whereby entities includingdepartments, business services, configuration items, work groups, and/oroperators respond to a system input signal that can be used to simulatecollective IT behavior. A simulation is activated 402 based on an eventselected from among a plurality of predetermined incidents and/orpredetermined system modifications. The method further comprisessimulating 403 performance to an incident and/or system modificationaffecting a business service according to the established businessimpact rules producing a response according to the established ITenvironmental model and simulating 404 changes to the IT environmentalmodel to validate expected changes to performance to the changes in anactual IT environment. The method 400 can also enable updating ofbusiness rules implemented in the simulation by displaying 405 rulesthat trigger the simulated incident and auditing 406 the rules, ifdesired.

In various embodiments, different techniques can be used to update thesimulation model. For example, a profile of a preselected operator canbe established 408 that estimates the operator's capability to processevents in terms of success, quality, and time in context of theestablished IT environment model. The operator can be reassigned 409based on the profiled capabilities.

In other embodiments, the method 400 may comprise predicting 411 changesto intensity and type of incident to determine expected incidentresponse time. The IT environment can be proactively modified 412 inresponse to the changes.

In various embodiments, events such as incidents and imposed businessservices modifications can be handled in different manners. In someembodiments, incidents can be classified 420 according to skills-basedresolutions behavior. The method 400 can further comprise determining421 behavior according to the classification for expectedpaths-to-resolution and expected processing time.

In some embodiments, a requested business services modification can bethe addition 430 of a new business service. The method 400 can furthercomprise estimating 431 cost and business impact for adding a newbusiness service to the established IT environment and selecting 432business services according to similarity and scaling of the selectedbusiness services, enabling estimation 433 of the impact of adding thenew business service.

Referring to FIG. 4B, a schematic flow chart depicts another embodimentof a service level management method 440 that includes businessinfrastructure simulation. The method 440 comprises collecting 441performance information for a service portfolio and modeling 442historical performance of the service portfolio based on the collectedinformation. A selected outage incident and/or a selected modificationin service level decisions can be simulated 443 to determine performanceimpact across business services.

Referring to FIG. 4C, a schematic flow chart depicts another embodimentof a service level management method 450 that implements othersimulation aspects. The method 450 comprises simulating 452 performanceto incidents and/or system modifications affecting a business serviceaccording to established business impact rules. A priority model isdeveloped 454 that specifies relative priority of an incident relativeto other processed incidents according to the simulation and servicelevel objectives and business impact information is integrated 456 intothe priority model. The priority model is deployed 458 into a servicemanagement environment and service level objectives and staffing changesare deployed 460 into the service management environment.

Referring to FIG. 4D, a graph describes a general method for integrationinto a priority model. Priority level is plotted against time,specifically service downtime. An incident ticket is opened at time T₁.Noted points on the time line include time to service level organization(SLO) response time breach T_(RT) and time to service level organization(SLO) service availability breach T_(SA). The graph show priorityattributable to response time P_(RT) and priority attributable toservice availability P_(SA), as well as integrated priority P_(I) whichcombines priority attributable to response time P_(RT) and priorityattributable to service availability P_(SA). The graph also shows directbusiness impact I_(B).

Referring to FIG. 5, a schematic block diagram illustrates an embodimentof a business services model 500 that operates in the manner of a“Sim-City” type simulation for an information technology (IT)application. Input parameters to the simulation can be grouped asenvironmental choices 502, staffing choices 504, and service objectives506, although other parameter groups may be implemented in otherapplications. Other constraints applied to the simulation includeenvironmental facts and/or constraints 508. The input parameters andenvironmental facts and/or constraints 508 are applied to make updistributions including an incident distribution 510 and a resolutiondistribution 512. The service simulation 514 executes based on theincident distribution 510 and the resolution distribution 512, and fromstaffing choices 504 and the service objectives 506. IT cost 516 isdetermined by a calculation that derives costs from environment factsand/or constraints 508, environmental choices 502, and staffing choices504. IT risk 518 is determined by a calculation based on simulationresults, business service relationships 520, and business value 522. ITvalue 524 is determined by a simulation as a function of IT cost 516 andIT risk 518.

The business services model 500 that is operational as part of ASLAS,rather than having human intervention to manually calculate a servicelevel baseline, uses application mapping with historical levelperformance, for example as configured in the environmental facts and/orconstraints 508, to calculate the service level baseline. With baselinesestablished, the ASLAS uses opportunity cost to recommendobjective-based service level agreements (SLAs), creating virtual SLAs.

Referring to FIG. 6A, a schematic flow chart depicts an embodiment of aservice level management method 600 that organizes operators into workgroups. The method 600 comprises identifying 602 a collection ofoperators as a work group, assigning 604 incidents to work groups basedon work group traffic and work group efficacy for dynamicallyaccommodating changes in loading, and modeling 606 behavior ofindividual operators in the work group and behavior of the collectivework group. Modeling 606 of the behavior can further comprise modeling607 of processing elasticity of the individual operators and/or workgroup with increasing demand of event processing. The method 600 canfurther comprise estimating 608 effective work time of resources for awork group whereby staff loading can be expressed as a full-timeequivalent and estimating 610 the staff loading using historic data.Events assigned to the work group are processed 612 according to themodeling.

Referring to FIG. 6B, a schematic flow chart depicts an embodiment of aservice level management method 620 that managers services according tobusiness impact. The method 620 comprises assigning 622 business impactto services and specifying 624 a unified cost function as a businessimpact that describes value of delivery of a portfolio of services byinformation technology to a business. Configuration of informationtechnology (IT) components can be optimized 626 using the unified costfunction whereby business impact within defined constraints isminimized. Assignment 628 of operators to work groups and staffing level630 across the IT components can be determined according to theoptimization. The method 620 further comprises specifying 632 servicelevel objectives for association with business services and selecting634 IT infrastructure according to the optimization, and determining 636relative merit of modification to staffing, service level objectives,and/or infrastructure.

In an embodiment, optimization may be initialized by equation. Forillustrative purposes, optimization for staffing problem statements isshown although a similar optimization can be used for various aspects ofa system. For an arbitrary time interval, let L={I_(S) ₁ , . . . ,I_(S)_(M) } be loading levels for a set of M assignment groups S={S₁, . . .,S_(M)} and let A={a_(b) ₁ , . . . ,a_(b) _(N) } be the availabilityassociated with N business services B={b₁, . . . ,b_(N)}. Theavailability a_(b) ₁ (L) of a business service is a function ofassignment group loading. The business impact for a business service isa function of availability I_(b) ₁ (a_(b) ₁ (L)). To minimize businessimpact, business service impact and staffing are taken intoconsideration according to equation (1):

$\begin{matrix}{\min\left\lbrack {{\sum\limits_{i}\; {I_{b_{i}}\left( {a_{b_{i}}(L)} \right)}} + {\sum\limits_{j}\; {\chi^{l}s_{j}}}} \right\rbrack} & (1)\end{matrix}$

where χ is the burdened cost of a full-time equivalent employee (FTE),subject to constraints a_(b) _(i) (L)>a_(b) _(i) , l_(s) ₁ >λ_(S) ₁ ,and

$\Lambda_{\min} < {\sum\limits_{j}\; l_{S_{j}}} < {\Lambda_{\max}.}$

Referring to FIG. 6C, a schematic flow chart depicts an embodiment of aservice level management method 640 that handles services according toincident analysis. The method 640 comprises generating 642 an incidentfor a business service based on expected failure, expected performancedeterioration, actual failure, or actual performance deterioration, andclassifying 644 incidents. Optimal routing of incidents to assignmentgroups is estimated 646 according to the incident classification andincident trajectories in simulation are reassigned 648 based on bestrouting estimates to determine business impact of optimal incidentassignment.

Referring to FIG. 7, a screen shot sequence depicts an embodiment of avalue center 700 implementation. Some embodiments may be configured forimproved quantification of the impact of downtime and performancedegradation. A problem with conventional analysis, which typically ismanual, is that the impact of downtime in continuity analysis is oftenestimated, for example at the business service level of abstraction, andmay be highly inaccurate.

Value centers 700 enable capture of the impact of downtime or degradedperformance at the level of the work around. For example, an EnterpriseResource Planning (ERP) system that ceases operation or becomes disabledwill have a different impact on order entry, accounts payable, orfinancial statement creation than an operational system. The valuecenter concept enables creation of a relationship between departments sothat each department can create auditable impact rules for downtime orimpact at specific time frames or for a selected time duration.

The value center 700 can be used to create a much more precise andauditable impact trail and can be used to perform several additionalfunctions. First, the value center 700 can be used to aggregate impactaffects in simulation from the business service to associated componentcustomer users.

Second, by running historical performance across value center impactrules, the value center 700 enables abstraction of service levelavailability into the dollars and cents impact of service delivery, andthen enables analysis of the abstracted information for areas wherestaffing or hardware architecture can limit business performance. Forexample, a business service may be scheduled to cease operation, such asfor maintenance, at a specified time and for a specified duration, butotherwise is fully operational. Value center analysis can determinethat, if impact is high a high performing service can limit businessperformance.

Third, a value center can aggregate the impact of value centers intobusiness services and then test a simulated data center outage, enablingan outage to be related to a business unit and associated services. Thevalue center further enables creation of continuity plans that arerealistic and well-designed.

Referring to FIG. 8, a schematic flow chart illustrates an embodiment ofa service level management method 800 that can be used as a decisioncenter for an automated service level agreement system (ASLAS). Theservice level management method comprising analyzes 802 a service levelorganization. The analysis 802 further comprises executing aretrospective analysis 804, an active analysis 806, and a proactiveanalysis 808. The retrospective analysis 804 is an analysis of historicperformance that deconstructs the historic performance according tohistoric actual functioning of information technology (IT) businessservices staff and service level organization deployment. The activeanalysis 806 inspects IT business services current performance. Theproactive analysis 808 can be a simulation of IT business servicesperformance including proactive entry of events and/or change conditionsfor staff and service level organization deployment simulation.

In an illustrative embodiment, retrospective analysis execution 804 canfurther comprise executing 810 a comprehensive restructuring of captureddata for analysis of historic performance and executing 812 at least oneOn-Line Analytical Processing (OLAP) and transactional analytics forhistoric performance characterization.

In some embodiments, the active analysis execution 806 can furthercomprise executing 814 at least one On-Line Analytical Processing (OLAP)and transactional analytics for current performance characterization,and defining 816 relationships between information technology (IT)business services and customers in an IT-centric perspective thatenables determination of impact of unavailable and/or non-performedbusiness services.

In some embodiments, executing 808 the proactive analysis can furthercomprise guided searching 818 to determine optimal efficiency level. Theguided search 818 takes into consideration one or more primary areas forsearch, for example business services, value centers, and assignmentgroups. The guided search 818 can also consider different approaches forsorting including business impact, differences in business impactbetween two scenarios such as a baseline scenario and a scenario withstaff cut by a selected percentage across assignment groups, differencesin business impact between two scenarios given a change or modification,and the like.

In an illustrative embodiment, the proactive analysis execution 808 canfurther comprise executing 820 a dynamically-configurable informationtechnology (IT) and business organization model that selectivelysimulates historic, current, and/or prospective performance, enablingexperimentation to determine potential performance before invokingorganizational changes. Proactive analysis 808 can further compriseexecuting 822 a guided search and optimization for recursive simulation,enabling optimal configuration by iterative simulation changes,determination of change vectors, and deploying simulation results into anew simulation model.

Traditionally, service level agreements (SLAs) are created throughactions of service personnel to manually define SLA parameters. Incontrast, the illustrative ASLAS automates SLA creation. The ASLAStypically uses software to calculate historic service levels andapplication mapping to determine which systems support uptime andservice performance, enabling elimination of much of the labor cost forSLA definition and creation. Additionally, the ASLAS can automaticallycalculate end-to-end uptime for a business services system throughconsideration of all components that affect uptime. The ASLAS can alsocalculate baseline SLAs by trading off delivered performance, relativelybusiness service importance, and recommended SLA baselines, usingtechniques that do not require human intervention or labor other thanmanual operations defined the service level maps and calculation ofdowntime impact.

Due to high labor intensive characteristics, a traditional manualService Level Management (SLM) implementation that operates across anextended enterprise can easily cost millions of dollars. Despite thehigh cost of traditional Service Level Management, operating without aService Level Agreement (SLA) is risky since costs of business servicesfailure or downtime can far exceed costs of SLM. Furthermore, without aSLA to define the business understanding between a business and aninformation technology (IT) provider, dispute and/or litigation costscan also be very high. Furthermore, disputes that result from lack of aSLA can divert personnel from proper business functions to handling andmanagement of the dispute, creating inefficiency and disabling the ITprovider from proper business functions. The ASLAS interrelates serviceperformance, opportunity cost, and available budget to enable an ITsupplier to better operate in a business-like manner and better realizeoperating factors such as cost and priority.

The various functions, processes, methods, and operations performed orexecuted by the system can be implemented as programs that areexecutable on various types of processors, controllers, centralprocessing units, microprocessors, digital signal processors, statemachines, programmable logic arrays, and the like. The programs can bestored on any computer-readable medium for use by or in connection withany computer-related system or method. A computer-readable medium is anelectronic, magnetic, optical, or other physical device or means thatcan contain or store a computer program for use by or in connection witha computer-related system, method, process, or procedure. Programs canbe embodied in a computer-readable medium for use by or in connectionwith an instruction execution system, device, component, element, orapparatus, such as a system based on a computer or processor, or othersystem that can fetch instructions from an instruction memory or storageof any appropriate type. A computer-readable medium can be anystructure, device, component, product, or other means that can store,communicate, propagate, or transport the program for use by or inconnection with the instruction execution system, apparatus, or device.

The illustrative block diagrams and flow charts depict process steps orblocks that may represent modules, segments, or portions of code thatinclude one or more executable instructions for implementing specificlogical functions or steps in the process. Although the particularexamples illustrate specific process steps or acts, many alternativeimplementations are possible and commonly made by simple design choice.Acts and steps may be executed in different order from the specificdescription herein, based on considerations of function, purpose,conformance to standard, legacy structure, and the like.

While the present disclosure describes various embodiments, theseembodiments are to be understood as illustrative and do not limit theclaim scope. Many variations, modifications, additions and improvementsof the described embodiments are possible. For example, those havingordinary skill in the art will readily implement the steps necessary toprovide the structures and methods disclosed herein, and will understandthat the process parameters, materials, and dimensions are given by wayof example only. The parameters, materials, and dimensions can be variedto achieve the desired structure as well as modifications, which arewithin the scope of the claims. Variations and modifications of theembodiments disclosed herein may also be made while remaining within thescope of the following claims.

1. A service level management method comprising: identifying connectedenterprise application components; under control of an automated system:relating historical performance for the connected enterprise applicationcomponents; and electronically creating a service level agreement basedon the historical performance relation.
 2. The method according to claim1 further comprising: creating a service catalog designating enterpriseapplication components.
 3. The method according to claim 1 furthercomprising: storing historical performance level data in a storage;interrelating business service object components with the storedhistorical performance level data; and electronically deriving businessservice component performance from the interrelated business serviceobject components and the historical performance level data.
 4. Themethod according to claim 3 further comprising: integrating the derivedbusiness service component performance into a historically-based servicebaseline; combining the historically-based service baseline into thehistorical performance level data.
 5. The method according to claim 4further comprising: collecting data from business customers regardingservice impact and opportunity cost for a changed component performancecondition; and analyzing a series of historically-based servicebaselines and opportunity cost data; and deriving an optimum servicelevel agreement baseline from the analysis.
 6. The method according toclaim 4 further comprising: collecting data from business customersregarding historical component outage and changed component performancecondition; and relating historical component outage and systemperformance variation to component performance.
 7. The method accordingto claim 1 further comprising: storing historical performance level datain a storage; interrelating business service object components with thestored historical performance level data; assigning business impact toservices; associating the business impact with a plurality of differentelements of an information technology (IT) environment includingbusiness services, departments, operators, and work groups; creating aunified metric for cost that is directly translatable to revenue fromthe assigned business impact; specifying a unified cost function as abusiness impact that describes value of delivery of a portfolio ofservices by information technology to a business; electronicallybalancing business impact aspects including service level, resources,service maintenance schedules, cost, and quality of service; andcreating virtual service level agreements based on the balancing.
 8. Themethod according to claim 1 further comprising: collecting informationon information technology (IT) organizations that supply services tobusiness services; collecting information on the business services thatreceive services from the IT organizations; establishing relationshipsbetween the IT organizations and business services; and establishingbusiness impact rules at the IT organization level for downtime and/ordegraded performance.
 9. The method according to claim 8 furthercomprising: analyzing the established IT organization-business servicesrelationships and the established business impact rules in combinationwith the historical performance; and determining a historicalperformance impact based on the analysis.
 10. The method according toclaim 8 further comprising: establishing an information technology (IT)environment model whereby entities including departments, businessservices, configuration items, work groups, and/or operators respond toa system input signal that can be used to simulate collective ITbehavior; activating a simulation based on an event selected from amonga plurality of predetermined incidents and/or predetermined systemmodifications; simulating performance to an incident and/or systemmodification affecting a business service according to the establishedbusiness impact rules producing a response according to the establishedIT environmental model; simulating changes to the IT environmental modelto validate expected changes to performance to the changes in an actualIT environment; displaying rules that trigger the simulated incident;and auditing the rules.
 11. The method according to claim 10 furthercomprising: establishing a profile of a preselected operator thatestimates the operator's capability to process events in terms ofsuccess, quality, and time in context of the established IT environmentmodel; and reassigning the operator based on the profiled capabilities.12. The method according to claim 10 further comprising: predictingchanges to intensity and type of incident to determine expected incidentresponse time and proactively modify the IT environment in response tothe changes.
 13. The method according to claim 10 further comprising:classifying incidents according to skills-based resolutions behavior;and determining behavior according to the classification for expectedpaths-to-resolution and expected processing time.
 14. The methodaccording to claim 10 further comprising: estimating cost and businessimpact for adding a new business service to the established ITenvironment; and selecting business services according to similarity andscaling the selected business services to estimate impact of adding thenew business service.
 15. The method according to claim 1 furthercomprising: collecting performance information for a service portfolio;modeling historical performance of the service portfolio based on thecollected information; and simulating a selected outage incident and/ora selected modification in service level decisions for performanceimpact across business services.
 16. The method according to claim 1further comprising: identifying a collection of operators as a workgroup; assigning incidents to work groups based on work group trafficand work group efficacy for dynamically accommodating changes inloading; modeling behavior of individual operators in the work group andbehavior of the collective work group, the modeled behavior furthercomprising modeling of processing elasticity of the individual operatorsand/or work group with increasing demand of event processing; estimatingeffective work time of resources for a work group whereby staff loadingcan be expressed as a full-time equivalent; estimating the staff loadingusing historic data; and processing events assigned to the work groupaccording to the modeling.
 17. The method according to claim 1 furthercomprising: identifying impact of downtime and/or degraded performanceat a work-around level; and creating an auditable impact trail based onthe identified impact.
 18. The method according to claim 1 furthercomprising: assigning business impact to services; specifying a unifiedcost function as a business impact that describes value of delivery of aportfolio of services by information technology to a business;optimizing configuration of information technology (IT) components usingthe unified cost function whereby business impact within definedconstraints is minimized; determining assignment of operators to workgroups according to the optimization; determining staffing level acrossthe IT components according to the optimization; specifying servicelevel objectives for association with business services according to theoptimization; selecting IT infrastructure according to the optimization;and determining relative merit of modification to staffing, servicelevel objectives, and/or infrastructure.
 19. The method according toclaim 1 further comprising: simulating performance to incidents and/orsystem modifications affecting a business service according toestablished business impact rules; developing a priority model thatspecifies relative priority of an incident relative to other processedincidents according to the simulation; integrating service levelobjectives and business impact information into the priority model;deploying the priority model into a service management environment; anddeploying service level objectives and staffing changes into the servicemanagement environment.
 20. The method according to claim 1 furthercomprising: generating an incident for a business service based onexpected failure, expected performance deterioration, actual failure, oractual performance deterioration classifying incidents; estimatingoptimal routing of incidents to assignment groups according to theincident classification; and reassigning incident trajectories insimulation based on best routing estimates to determine business impactof optimal incident assignment.
 21. A service level management methodcomprising: analyzing a service level organization comprising: executinga retrospective analysis of historic performance that deconstructs thehistoric performance according to historic actual functioning ofinformation technology (IT) business services staff and service levelorganization deployment; executing an active analysis of IT businessservices current performance; and executing a proactive analysis ofsimulated IT business services performance including proactive entry ofevents and/or change conditions for staff and service level organizationdeployment simulation.
 22. The method according to claim 21 comprising:executing the retrospective analysis further comprising: executing acomprehensive restructuring of captured data for analysis of historicperformance; and executing at least one On-Line Analytical Processing(OLAP) and transactional analytics for historic performancecharacterization.
 23. The method according to claim 21 comprising:executing the active analysis further comprising: executing at least oneOn-Line Analytical Processing (OLAP) and transactional analytics forcurrent performance characterization; defining relationships betweeninformation technology (IT) business services and customers in anIT-centric perspective that enables determination of impact ofunavailable and/or non-performed business services.
 24. The methodaccording to claim 21 comprising: executing the proactive analysisfurther comprising guided searching to determine optimal efficiencylevel.
 25. The method according to claim 21 comprising: executing theproactive analysis further comprising: executing adynamically-configurable information technology (IT) and businessorganization model that selectively simulates historic, current, and/orprospective performance, enabling experimentation to determine potentialperformance before invoking organizational changes; and executing aguided search and optimization for recursive simulation, enablingoptimal configuration by iterative simulation changes, determination ofchange vectors, and deploying simulation results into a new simulationmodel.